Journal article

Approximate clustering in very large relational data

JC Bezdek, RJ Hathaway, JM Huband, C Leckie, R Kotagiri

International Journal of Intelligent Systems | Published : 2006

Abstract

Different extensions of fuzzy c-means (FCM) clustering have been developed to approximate FCM clustering in very large (unloadable) image (eFFCM) and object vector (geFFCM) data. Both extensions share three phases: (1) progressive sampling of the VL data, terminated when a sample passes a statistical goodness of fit test; (2) clustering with (literal or exact) FCM; and (3) noniterative extension of the literal clusters to the remainder of the data set. This article presents a comparable method for the remaining case of interest, namely, clustering in VL relational data. We will propose and discuss each of the four phases of eNERF and our algorithm for this last case: (1) finding distinguishe..

View full abstract

University of Melbourne Researchers